Senseval: The CL Research Experience

نویسنده

  • Kenneth C. Litkowski
چکیده

The CL Research Senseval system was the highest performing system among the “Allwords” systems, with an overall fine-grained score of 61.6 percent for precision and 60.5 percent for recall on 98 percent of the 8,448 texts on the revised submission (up by almost 6 and 9 percent from the first). The results were achieved with an almost complete reliance on syntactic behavior, using (1) a robust and fast ATN-style parser producing parse trees with annotations on nodes, (2) DIMAP dictionary creation and maintenance software (after conversion of the Hector dictionary files) to hold dictionary entries, and (3) a strategy for analyzing the parse trees in concert with the dictionary data. Further considerable improvements are possible in the parser, exploitation of the Hector data (and representation of dictionary entries), and the analysis strategy, still with syntactic and collocational data. The Senseval data (the dictionary entries and the corpora) provide an excellent testbed for understanding the sources of failures and for evaluating changes in the CL Research system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explorations in disambiguation using XML text representation

In SENSEVAL-3, CL Research participated in four tasks: English all-words, English lexical sample, disambiguation of WordNet glosses, and automatic labeling of semantic roles. This participation was performed within the development of CL Research’s Knowledge Management System, which massively tags texts with syntactic, semantic, and discourse characterizations and attributes. This System is full...

متن کامل

Designing a task for SENSEVAL-2

This document provides guidelines for setting up and running a task for SENSEVAL-2. These guidelines are based on experience gathered from SENSEVAL-1, which involved individually organized tasks in English (Kilgarriff and Rosenzweig 2000), French (Segond 2000), and Italian (Calzolari and Corazzari 2000), and on the evaluation protocol proposed by Resnik and Yarowsky (1999). Although Resnik and ...

متن کامل

SENSEVAL-2 The Swedish Framework

In this paper we describe the organisation and results of the SENSEVAL-2 exercise for Swedish. We present some of the experiences we gained by participating as developers and organisers in the exercise. We particularly focus on the choice of the lexical and corpus material, the annotation process, the scoring scheme, the motivations for choosing the lexical-sample branch of the exercise, the pa...

متن کامل

Swedish SENSEVAL, a Developer's Perspective

ny computer programs for automatically determining which sense of a iven context, according to a variety of semantic, defining or other types EVALuation (SENSEVAL) is an open, community-based evaluation Disambiguation (WSD) programs, arranged for a second consecutive exercise is to be able to say which programs and methods perform betwords, or varieties of language, present particular problems ...

متن کامل

UBB system at Senseval-3

It is known that whenever a system’s actions depend on the meaning of the text being processed, disambiguation is beneficial or even necessary. The contest Senseval is an international frame where the research in this important field is validated in an hierarchical manner. In this paper we present our system participating for the first time at Senseval 3 contest on WSD, contest developed in Mar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers and the Humanities

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2000